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Please allow entry of the Sequence Listing submitted herewith. 

IN THE SPECIFICATION 

Please amend and replace the paragraph, at page 9, lines 20-26 of Table 1, as follows: 

-(The S4 H2N-terminal amino acid sequence determined using the automated protein 
sequenator is shown in blocks as the mature protein sequence. Residues that were questionable 
in the sequence are indicated by brackets. The DNA (SEQ ID NO:l) and predicted amino acid 
sequences (SEQ ID N0:2) are shown. Possible initiation codons are indicated by f-Met. A 
putative proteolytic cleavage site is indicated by *. The mature protein sequence as indicated is 
SEQ ID N0:4. The oligonucleotide probe sequence is shown in the block labeled probe 21D3 
(SEQ ID N0:3). The abbreviations used are: P = G or A; Y = T or C; N = A, C, G, or L.f 

Please amend and replace the paragraph, at page 14, line 35 through page 15, line 12, as 
follows: _ 

■^Because of the high C+ G content of B. pertussis DNA, it was necessary to use both of 
the above mentioned methods with a combination of 8% and 20% polyacrylamide- 8 M urea gels 
for sequence analysis. Each nucleotide has been sequenced in both directions on average of 4.13 
times. The final consensus sequence of the sense strand is shown in Table 2. It is noted that the 
sequence of the S4 subunit gene has been included in this table for completeness since this 
sequence lies in the middle of the structural gene sequence presented in Table 2. The complete 
nucleotide sequence of Pertussis Toxin gene (SEQ ID N0:5) and deduced amino acid sequence 
(SEQ ID N0s:6-1 1) are presented in Table 2. The entire sequence contains about 62.2% C+G 
with about 19.6% A, 33.8% C, 28.4% G and 18.2% T in the sense strand, wherein A, T, C and G 
represent the nucleotides adenine, thymine, cytosine and guanine, respectively.AJ 
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Please amend and replace the Table, on page 21, as follows: 

^ . ^ 




Table 4 



Comparison of Two Homologous Regions in ADP-ribosylating subunits of Pertussis, Cholera, 

and E. Coli Heat Labile Toxins 



Region 1 


SEQDD 
NO: 








Pertussis SI subunit 


12 


(8) 


Tyr Arg Tyr Asp Ser Arg Pro Pro 


(15) 


Cholera* A subunit 


13 


(6) 


Tyr Arg Ala Asp Ser Arg Pro Pro 


(13) 


E. coli'* HLT A Subunit 


14 


(6) 


Tyr Arg Ala Asp Ser Arg Pro Pro 


(13) 


Region 2 










Pertussis SI subunit 


15 


(51) 


Val Ser Thr Ser Ser Ser Arg Arg 


(58) 


Cholera^ A subunit 


16 


(60) 


Val Ser Thr Ser lie Ser Leu Arg 


(67) 


E. coli* HLT A Subunit 


17 


(50) 


Val Ser Thr Ser Leu Ser Leu Arg 


(67) 



The numbers in parentheses refer to the amino acid position in the mature proteins. 
^Data from Yamamoto, et al. FEES Letter 169:241. 1983 

HLT - Heat Labile Toxin p 



Please amend and replace the paragraph, at page 26, line 22 through page 27, line 25, as 
follows: — 

[since all pertussis toxin subunits are closely linked and probably expressed in a 

very precise ratio, it is possible that they are arranged in a polycistronic operon. A polycistronic 
arrangement for the subunit cistrons also has been described for other bacterial toxins bearing 
similar enzymatic functions, such as diphtheria, cholera, and E. coli heat labile toxins. 
Therefore, the flanking regions were analyzed for the presence of transcriptional signals. In the 
5' flanking region, starting at position 469, the sequence TAAAATA (SEQ ID NO: 18) was 
found, which six of the seven nucleotides found in the ideal TATAATA (SEQ ID NO: 19) 
Pribnow or -10 box. An identical sequence can be found in several other bacterial promoters, 
including the lambda L57 promoter. Given the fact that most transcripts start as a purine residue 
about 5-7 nucleotides downstream from the Pribnow box, the transcriptional start site was 
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tentatively located at the adenine residue at position 482. This residue is located in the sequence 
CAT, often found at transcriptional start sites. Upstream from the proposed -10 box, the 
sequence CTGACC (SEQ ID NO:20) starts at position 442, This sequence matches four of the 
six nucleotides found in the ideal E. coli -35 box TTGACA (SEQ ID N0:21). The mismatching 
nucleotides in the proposed pertussis toxin -35 box are the two end nucleotides, of which the 3' 
residue is the less important nucleotide in the E. coh -35 consensus box. A replacement of the T 
by a C in the first position of the consensus sequence can also be found in several E. coli 
promoters. The distance between the two proposed promoter boxes is 21 nucleotides, a distance 
of the same length has been found in the galPl promoter and in several plasmid promoters. The 
proposed -35 box is immediately preceded by two overlapping short inverted repeats with 
calculated free energies of -15,6 kcal and -8.6 kcal, respectively. Inverted repeats can also be 
found at the 5 '-end of the cholera toxin promoter. In both cases, they may be involved in 
positive regulation of the toxin promoters. None of the ORFs assigned to the other subunit is 
closely preceded by a similar promoter-like structure. However, a different promoter-like- 
structure was found associated with the S4 subunit ORF. - 

Please amend and replace the paragraph, at page 28, line 15 through page 29, line 12, as 
follows: 

Additionally, the 5 '-flanking region of each cistron was also examined for the presence of 
ribosomal binding sites. Neither the ribosomal binding sequences for B. pertussis genes, nor the 
3'-end sequence of the 16S rRNA are known. Therefore, the flanking regions could be 
compared with only the ribosomal binding sequences of heterologous procaryotic organisms 
represented by the Shine-Dalgamo sequence. Preceding the SI initiation codon, the sequence 
GGGGAAG (SEQ ID NO:22) was found starting at position 495. This sequence shares four out 
of seven nucleotides with ideal Shine-Dalgamo sequence AAGGAGG (SEQ ID NO:23). The 
two first mismatching nucleotides in the pertussis toxin gene would not destabilize the 
hybridization to the 3'-end of the E. coli 16 S rRNA. This putative ribosomal binding site is 
close enough to the initiation codon for SI to be functional in E. coli . Another possible Shine- 
Dalgamo sequence overiaps the first one and also matches four out of seven nucleotides to the 
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